Search CORE

104 research outputs found

Detecting highly overlapping community structure by greedy clique expansion

Author: Hurley Neil
Lee Conrad
McDaid Aaron
Reid Fergal
Publication venue
Publication date: 01/01/2010
Field of study

In complex networks it is common for each node to belong to several communities, implying a highly overlapping community structure. Recent advances in benchmarking indicate that existing community assignment algorithms that are capable of detecting overlapping communities perform well only when the extent of community overlap is kept to modest levels. To overcome this limitation, we introduce a new community assignment algorithm called Greedy Clique Expansion (GCE). The algorithm identifies distinct cliques as seeds and expands these seeds by greedily optimizing a local fitness function. We perform extensive benchmarks on synthetic data to demonstrate that GCE's good performance is robust across diverse graph topologies. Significantly, GCE is the only algorithm to perform well on these synthetic graphs, in which every node belongs to multiple communities. Furthermore, when put to the task of identifying functional modules in protein interaction data, and college dorm assignments in Facebook friendship data, we find that GCE performs competitively.Comment: 10 pages, 7 Figures. Implementation source and binaries available at http://sites.google.com/site/greedycliqueexpansion

arXiv.org e-Print Archive

CiteSeerX

Research Repository UCD

Irish Universities

Seeding for pervasively overlapping communities

Author: A. Mislove
Aaron McDaid
C. Lee
Conrad Lee
F. Luo
Fergal Reid
J. Baumes
Neil Hurley
R. Andersen
Publication venue: 'American Physical Society (APS)'
Publication date: 29/04/2011
Field of study

In some social and biological networks, the majority of nodes belong to multiple communities. It has recently been shown that a number of the algorithms that are designed to detect overlapping communities do not perform well in such highly overlapping settings. Here, we consider one class of these algorithms, those which optimize a local fitness measure, typically by using a greedy heuristic to expand a seed into a community. We perform synthetic benchmarks which indicate that an appropriate seeding strategy becomes increasingly important as the extent of community overlap increases. We find that distinct cliques provide the best seeds. We find further support for this seeding strategy with benchmarks on a Facebook network and the yeast interactome.Comment: 8 Page

arXiv.org e-Print Archive

Crossref

Quantifying the extent to which index event biases influence large genetic association studies

Author: Aaron McDaid
Andrew R. Wood
Anna Murray
Archie Campbell
Caroline Hayward
Colin Palmer
Ewan R. Pearson
Gage
Hanieh Yaghootkar
James S. Pankow
Jessica Tyrrell
Katherine S. Ruth
Larger
Louise Donnelly
Lynne J. Hocking
Marcus A. Tuke
Michael N. Weedon
Michael P. Bancks
Morris
Patricia B. Munroe
Rachel M. Freathy
Robin Beaumont
Sam E. Jones
Timothy M. Frayling
Zoltán Kutalik
Publication venue: 'Oxford University Press (OUP)'
Publication date: 30/12/2016
Field of study

This is the author accepted manuscript. The final version is available from the publisher via the DOI in this record.As genetic association studies increase in size to 100,000s of individuals, subtle biases may influence conclusions. One possible bias is "index event bias" (IEB) that appears due to the stratification by, or enrichment for, disease status when testing associations between genetic variants and a disease-associated trait. We aimed to test the extent to which IEB influences some known trait associations in a range of study designs and provide a statistical framework for assessing future associations. Analysing data from 113,203 non-diabetic UK Biobank participants, we observed three (near TCF7L2, CDKN2AB and CDKAL1) overestimated (BMI-decreasing) and one (near MTNR1B) underestimated (BMI-increasing) associations among 11 type 2 diabetes risk alleles (at P 500,000 if the prevalence of those diseases differs by > 10% from the background population. In conclusion, IEB may result in false positive or negative genetic associations in very large studies stratified or strongly enriched for/against disease cases.H.Y., A.R.W. and T.M.F. are supported by the European Research Council grant: 323195; SZ-245 50371-GLUCOSEGENES-FP7-IDEAS-ERC. S.E.J. is funded by the Medical Research Council (grant: MR/M005070/1). M.A.T., M.N.W. and A.M. are supported by the Wellcome Trust Institutional Strategic Support Award (WT097835MF). R.M.F. is a Sir Henry Dale Fellow (Wellcome Trust and Royal Society grant: 104150/Z/14/Z). R.B. is funded by the Wellcome Trust and Royal Society grant: 104150/Z/14/Z. J.T. is funded by a Diabetes Research and Wellness Foundation Fellowship. Z.K. received financial support from the Leenaards Foundation, the Swiss Institute of Bioinformatics and the Swiss National Science Foundation (31003A-143914) and SystemsX.ch (39). The work of M.P.B was supported by the National Heart, Lung, And Blood Institute of the National Institutes of Health under Award no. T32HL007779. Generation Scotland received core support from the Chief Scientist Office of the Scottish Government Health Directorates [CZD/16/6] and the Scottish Funding Council [HR03006]. E.R.P. holds a WT New investigator award 102820/Z/13/Z

Crossref

Serveur académique lausannois

Edinburgh Research Explorer

Open Research Exeter

University of Dundee Online Publications

Queen Mary Research Online

CNV-association meta-analysis in 191,161 European adults reveals new loci associated with anthropometric traits

Author: Afaq Saima
Ang Wei Q.
Beaumont Robin N.
Beckmann Jacques S.
Bochud Murielle
Boers Harmen
Borodulin Katja
Bottinger Erwin P.
Chambers John C.
Christensen Kaare
Cusi Daniele
Deelen Patrick
Deloukas Panos
Eriksson Johan G.
Feenstra Bjarke
Feitosa Mary F.
Franke Lude
Frayling Timothy M.
Freathy Rachel M.
Geller Frank
Hansen Thomas F.
Havulinna Aki S.
Hayward Caroline
Heid Iris M.
Hirschhorn Joel
Jacquemont Sébastien
Jones Samuel E.
Kooner Jaspal S.
Koponen Päivikki
Koskinen Seppo
Kriebel Jennifer
Kristiansson Kati
Kutalik Zoltán
Kähönen Mika
Lahti Jari
Lehtimäki Terho
Lenzini Petra
Lettre Guillaume
Liu Xueping
Lokki Marja Liisa
Loos Ruth J.F.
Lundqvist Annamari
Macé Aurélien
Mangino Massimo
Martin Nicholas G.
Mattsson Hannele
McDaid Aaron F.
Medland Sarah E.
Meitinger Thomas
Melbye Mads
Metspalu Andres
Montgomery Grant W.
Murray Anna
Mägi Reedik
Männik Katrin
Männistö Satu
Müller-Nurasyid Martina
Newman Anne B.
Nieminen Markku
Nolte Ilja M.
Nyholt Dale R.
Nõukas Margit
Oldehinkel Albertine J.
Palotie Aarno
Pennell Craig
Perls Thomas T.
Perola Markus
Peters Annette
Porcu Eleonora
Porteous David
Province Michael A.
Raitakari Olli T.
Reymond Alexandre
Rissanen Harri
Rivadeneira Fernando
Rosengren Anders
Ruth Katherine S.
Rüeger Sina
Salomaa Veikko
Salvi Erika
Sapkota Yadav
Schick Ursula
Schupf Nicole
Shrine Nick
Sinisalo Juha
Snieder Harold
Sparsø Thomas
Spector Timothy D.
Strachan David P.
Swertz Morris A.
Tobin Martin D.
Tuke Marcus A.
Tyrrell Jessica
Van Der Most Peter J.
Vartiainen Erkki
Venturini Cristina
Viikari Jorma S.
Wain Louise V.
Weedon Michael N.
Werge Thomas
Winkler Thomas W.
Wojczynski Mary K.
Wood Andrew R.
Wray Naomi R.
Yaghootkar Hanieh
Zhang Weihua
Publication venue
Publication date: 01/01/2017
Field of study

Funding Information: This research has been conducted using the UK Biobank Resource. This research has been conducted using the Danish National Biobank resource. The authors are grateful to the Raine Study participants and their families, and to the Raine Study research staff for cohort co-ordination and data collection. QIMR is grateful to the twins and their families for their generous participation in these studies. We would like to thank staff at the Queensland Institute of Medical Research: Anjali Henders, Dixie Statham, Lisa Bowdler, Ann Eldridge, and Marlene Grace for sample collection, processing and genotyping, Scott Gordon, Brian McEvoy, Belinda Cornes and Beben Benyamin for data QC and preparation, and David Smyth and Harry Beeby for IT support. HBCS Acknowledgements: We thank all study participants as well as everybody involved in the Helsinki Birth Cohort Study. Helsinki Birth Cohort Study has been supported by grants from the Academy of Finland, the Finnish Diabetes Research Society, Folkhälsan Research Foundation, Novo Nordisk Foundation, Finska Läkaresällskapet, Juho Vainio Foundation, Signe and Ane Gyllenberg Foundation, University of Helsinki, Ministry of Education, Ahokas Foundation, Emil Aaltonen Foundation. Finrisk study is grateful for the THL DNA laboratory for its skillful work to produce the DNA samples used in this study and thanks the Sanger Institute and FIMM genotyping facilities for genotyping the samples. We thank the MOLGENIS team and Genomics Coordination Center of the University Medical Center Groningen for software development and data management, in particular Marieke Bijlsma and Edith Adriaanse. This work was supported by the Leenards Foundation (to Z.K.), the Swiss National Science Foundation (31003A_169929 to Z.K., Sinergia grant CRSII33-133044 to AR), Simons Foundation (SFARI274424 to AR) and SystemsX.ch (51RTP0_151019 to Z.K.). A.R.W., H.Y. and T.M.F. are supported by the European Research Council grant: 323195:SZ-245. M.A.T., M.N.W. and An.M. are supported by the Wellcome Trust Institutional Strategic Support Award (WT097835MF). For full funding information of all participating cohorts see Supplementary Note 2. Publisher Copyright: © 2017 The Author(s).There are few examples of robust associations between rare copy number variants (CNVs) and complex continuous human traits. Here we present a large-scale CNV association meta-analysis on anthropometric traits in up to 191,161 adult samples from 26 cohorts. The study reveals five CNV associations at 1q21.1, 3q29, 7q11.23, 11p14.2, and 18q21.32 and confirms two known loci at 16p11.2 and 22q11.21, implicating at least one anthropometric trait. The discovered CNVs are recurrent and rare (0.01-0.2%), with large effects on height (> 2.4 cm), weight ( 5 kg), and body mass index (BMI) (> 3.5 kg/m(2)). Burden analysis shows a 0.41 cm decrease in height, a 0.003 increase in waist-to-hip ratio and increase in BMI by 0.14 kg/m2 for each Mb of total deletion burden (P = 2.5 x 10(-10), 6.0 x 10(-5), and 2.9 x 10(-3)). Our study provides evidence that the same genes (e.g., MC4R, FIBIN, and FMO5) harbor both common and rare variants affecting body size and that anthropometric traits share genetic loci with developmental and psychiatric disorders.Peer reviewe